Development of HMM-based Malay Text-to-Speech System
نویسندگان
چکیده
This paper presents the development of a hidden Markov model (HMM)-based Malay text-to-speech (TTS) system. To our knowledge, this is the first report on the development of the HMM-based speech synthesis system for the Malay language. In this paper, We first discuss the Malay speech characteristics, specifically, on Malay phonological system and syllable structure. In the Malay phonological system, 37 phonemes are adopted as the phonemic representations. Then, we describe a HMMbased TTS framework and language specific knowledge such as phonological, linguistic information, and utterance structure, which is used in context dependent continuous HMM and treebased clustering. After that, we report the development of Malay TTS corpora. Finally, a male and a female HMM-based Malay TTS systems are developed and evaluated. We further conduct listening test based on the Mean Opinion Score (MOS), and the results show that the developed HMM-based Malay TTS system can generate speech with acceptable quality in terms of naturalness and intelligibility.
منابع مشابه
A Cross-Lingual Approach to the Development of an HMM-Based Speech Synthesis System for Malay
This research reports the development of an HMM-based speech synthesis system for Malay, which is an underresourced language with few resources including recorded speech and segmental labels. We propose the cross-lingual use of resources for developing a Malay HMM-based speech synthesis system. We used the Festival English speech synthesis system to generate time-aligned phone transcriptions fo...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملStatistical Parametric Evaluation on New Corpus Design for Malay Speech Articulation Disorder Early Diagnosis
Corresponding Author: Tan Tian Swee Medical Implant Technology Group (MediTEG), Cardiovascular Engineering Center, Material Manufacturing Research Alliance (MMRA), Faculty of Biosciences and Medical Engineering, Universiti Teknologi Malaysia, Malaysia Email: [email protected] Abstract: Speech-to-Text or always been known as speech recognition plays an important role nowadays especially...
متن کاملSpeech enhancement based on hidden Markov model using sparse code shrinkage
This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...
متن کاملIsolated Malay Digit Recognition Using Pattern Recognition Fusion of Dynamic Time Warping and Hidden Markov Models
This paper is presents a pattern recognition fusion method for isolated Malay digit recognition using Dynamic Time Warping (DTW) and Hidden Markov Model (HMM). The aim of the project is to increase the accuracy percentage of Malay speech recognition. This study proposes an algorithm for pattern recognition fusion of the recognition models. The endpoint detection, framing, normalization, Mel Fre...
متن کامل